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Related Application 

The subject matter of this application is related to the subject matter in a 
co-pending non-provisional application by the same inventors as the instant 
application and filed on the same day as the instant application entitled, "Applying 
Term Consistency to an Equality Constrained Interval Global Optimization 
Problem," having serial number TO BE ASSIGNED, and filing date TO BE 
ASSIGNED (Attorney Docket No. SUN-P6445-SPL). 

BACKGROUND 

Field of the Invention 

The present invention relates to performing arithmetic operations on 
interval operands within a computer system. More specifically, the present 
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invention relates to a method and an apparatus for using a computer system to 
solve a global optimization problem including inequality constraints with interval 
arithmetic and term consistency. 



Related Art 

Rapid advances in computing technology make it possible to perform 
trillions of computational operations each second. This tremendous 
computational speed makes it practical to perform computationally intensive tasks 
as diverse as predicting the weather and optimizing the design of an aircraft 
engine. Such computational tasks are typically performed using machine- 
representable floating-point numbers to approximate values of real numbers. (For 
example, see the Institute of Electriical and Electronics Engineers (IEEE) standard 
754 for binary floating-point numbers.) 

In spite of their limitations, floating-point numbers are generally used to 
perform most computational tasks. 

One limitation is that machine-representable floating-point numbers have a 
fixed-size word length, which limits their accuracy. Note that a floating-point 
number is typically encoded using a 32, 64 or 128-bit binary number, which 
means that there are only 2^^, 2^^ or 2^^^ possible symbols that can be used to 
20 specify a floating-point number. Hence, most real number values can only be 
approximated with a corresponding floating-point number. This creates 
estimation errors that can be magnified through even a few computations, thereby 
adversely affecting the accuracy of a computation. 

A related limitation is that floating-point numbers contain no information 
25 about their accuracy. Most measured data values include some amount of error 
that arises from the measurement process itself This error can often be quantified 
as an accuracy parameter, which can subsequently be used to determine the 
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accuracy of a computation. However, floating-point numbers are not designed to 
keep track of accuracy information, whether from input data measurement errors 
or machine rounding errors. Hence, it is not possible to determine the accuracy of 
a computation by merely examining the floating-point number that results from 
5 the computation. 

Interval arithmetic has been developed to solve the above-described 
problems. Interval arithmetic represents numbers as intervals specified by a first 
(left) endpoint and a second (right) endpoint. For example, the interval [a, b], 
where a < Z?, is a closed, bounded subset of the real numbers, R, which includes a 

10 and b as well as all real numbers between amdb. Arithmetic operations on 
interval operands (interval arithmetic) are defined so that interval results always 
contain the entire set of possible values. The result is a mathematical system for 
rigorously bounding numerical errors from all sources, including measurement 
data errors, machine rounding errors and their interactions. (Note that the first 

1 5 endpoint normally contains the "infmium", which is the largest number that is less 
than or equal to each of a given set of real numbers. Similarly, the second 
endpoint normally contains the "supremum", which is the smallest number that is 
greater than or equal to each of the given set of real numbers.) 

One commonly performed computational operation is to perform 

20 inequality constrained global optunization to find a global minimum of a 

nonlinear objective functionXx), subject to nonlinear inequality constraints of the 
form pI\) < 0 (/=1 . .,m). This can be accomplished using any members of a set 
of criteria to delete boxes, or parts of boxes that either fail to satisfy one or more 
inequality constraints, or cannot contain the global minimum/^ given the 

25 inequality constraints are all satisfied. The set of criteria includes: 

(1) the/_&ar-criterion, wherein \ff_bar is the smallest upper bound so 
far computed on / within the feasible region defined by the inequality constraints. 
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then any point x for which/x) >fj)ar can be deleted. Similarly, any box X can 
be deleted if /w/(/(X)) >fjar\ 

(2) the monotonicity criterion, wherein if g(x) is the gradient of f 
evaluated at a feasible point x for which ail p^x) < 0 {i=\,...,m\ then any such 

5 feasible point x for which g(x) ^ 0 can be deleted. Similarly, any feasible box X 
can be deleted if 0 C g(X); 

(3) the convexity criterion, wherein if H^^{x) is the z-th diagonal 
element of the Hessian of/ then any feasible point x for all which all piix) < 0 
(/=!,. and //«(x) < 0 (for /=!,.. can be deleted. Similarly, any feasible 

10 box X can be deleted if //„(X) < 0 (for /=1,. . and 

(4) the stationary point criterion, wherein any point x is deleted using 
the interval Newton technique to solve the John conditions. (The John conditions 
are described in "Global Optimization Using Interval Analysis" by Eldon R. 
Hansen, Marcel Dekker, Inc., 1992.) 

1 5 All of these criteria work best "in the small" when the objective function/ 

is approximately linear or quadratic and "active" constraints are approximately 
linear. An active constraint is one that is zero at a solution point. For large 
intervals containing multiple local rainima and maxima none of the criteria will 
succeed in deleting much of a given box. In this case the box is split into two or 

20 more sub-boxes, which are then processed independently. By this mechanism all 
the inequality constrained global minima of a nonlinear objective function can be 
found. 

One problem is applying this procedure to large /^-dimensional interval 
vectors (or boxes) that contain multiple local minima. In this case, the process of 
25 splitting in ^-dimensions can lead to exponential growth in the number of boxes 
to process. 
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It is well known that this problem (and even the problem of computing 
"sharp" bounds on the range of a function of w-variables over an n-dimensional 
box) is an "NP-hard" problem. In general, NP-hard problems require an 
exponentially increasing amount of work to solve as n, the number of independent 
5 variables, increases. 

Because NP-hardness is a worst-case property and because many practical 
engineering and scientific problems have relatively simple structure, one problem 
is to use this simple structure of real problems to improve the efficiency of 
interval inequality constrained global optimization algorithms. 
10 Hence, what is needed is a method and an apparatus for using the structure 

of a nonlinear objective function to improve the efficiency of interval inequality 
constrained global optimization soihvare. To this end, what is needed is a method 
and apparatus that efficiently deletes "large" boxes or parts of large boxes that the 
above criteria can only split. 

15 

SUMMARY 

One embodiment of the present invention provides a system that solves a 
global optimization problem specified by a function / and a set of inequality 
constraints pi{\) < 0 {i^U ••■,m), wherein /is a scalar function of a vector 

20 X (xj, X2, xs, ... x«). The system operates by receiving a representation of the 
function/ and the set of inequality constraints, and then storing the representation 
in a memory within the computer system. Next, the system applies term 
consistency to the set of inequality constraints over a subbox X, and excludes any 
portion of the subbox X that violates any member of the set of inequality 

25 constraints. 

In one embodiment of the present invention, the system additionally 
linearizes the set of inequality constraints to produce a set of linear inequality 
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constraints with interval coefficients. Next, the system preconditions the set of 
linear inequality constraints using additive linear combinations to produce a 
preconditioned set of linear inequality constraints. The system then applies term 
consistency to the set of preconditioned linear inequality constraints over the 
5 subbox X, and excludes any portion of the subbox X that violates set of 

preconditioned linear inequality constraints. In a variation on this embodiment, 
the system additionally keeps track of a least upper homxd fjbar of the function 
f[x) in the feasible region, and includes y(x) <f_bar in the set of inequality 
constraints prior to linearizing the set of inequality constraints. In a variation on 
10 this embodiment, the system removes from consideration any inequality 

. constraints that are not violated by more than a predetermined amount for 

purposes of applying term consistency prior to linearizing the set of inequality 

u constraints. 

^ In one embodiment of the present invention, performing the interval global 

15 optimization process involves keep ing track of a least upper hound fj)ar of the 
fimctiony(x) in the feasible region, and removing from consideration any subbox 
J for which^x) >f_bar. It also involves applying term consistency to the inequality 

fix) < f_bar over the subbox X, and excluding any portion of the subbox X that 
£3 violates the inequality. 

20 In one embodiment of the present invention, if the subbox X is strictly 

feasible (p/(X) < 0 for all z==i, performing the interval global optimization 
process involves determining a gradient g(x) of the function^x), wherein g(x) 
includes components gi{x) .,.,n). Next, the system removes from 
consideration any subbox for which g(x) is bounded away from zero, thereby 
25 indicating that the subbox does not include an unconstrained local extremum. The 
system also applies term consistency to each component gi(x)^0 of 
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g(x)=0 over the subbox X, and excludes any portion of the subbox X that violates 
a component. 

In one embodiment of the present invention, if the subbox X is strictly 
feasible (pi(X) < 0 for all i=l, ...,n), the system performs the interval global 
5 optimization process by determining diagonal elements i/«(x) (i = 7, . . . , n) of the 
Hessian of the function /x), and removing from consideration any subbox for 
which a diagonal element of the Hessian is always negative, indicating that the 
function /is not convex and consequently does not contain a global minimum 
within the subbox. The system also applies term consistency to each inequality 
10 Hn{\) ^ 0 {i^l, „„n) over the subbox X, and excludes any portion of the subbox X 
that violates the inequalities. 

In one embodiment of the present invention, if the subbox X is strictly 
J. feasible (that is p^Qi) < 0 for all /= l...,m), then the system uses the interval 

5 'I Newton method to find a box X that contains a stationary point y where the 

15 gradient of/, g(y) = 0. This involves forming and solving the system of equations 
M(x,X)(y-x) - r(x) for the bounding box Y on y: where M(x,X) = BJ(x,X); 
m ', J(x,X) is the Jacobian of the objective function/expanded about the point x over 

the subbox X; B is the approximate inverse of the center of J(x,X); and 
□ r(x) = -Bg(x). The system also applies term consistency to each equality 

20 (Bg(x))^ = 0 (j^l for each variable Xj (/=7, ...,n) over the subbox X, and 
excludes any portion of the subbox X that violates an equality. 

For any function of w-variables/x) there are different ways to analytically 
express a component Xj of the vector x. For example, one can v^ite 
fix) = g(X;) - h{x), where g(xj) is a term in / for which it is possible to solve 
25 g{xj) = y for an element of x, Xj , using g\y). For each of these rearrangements, if 
a given interval box X is used as an argument of h, then the new interval X/ for 
the y-th component of X, is guaranteed to be at least as narrow as the original, Xj . 
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X/==Xjn X) where X) =g\h{X)). 

This process is then be iterated using different terms g of the function / 
5 After reducing any element Xj of the box X to J^^, the reduced value can be used 
in X thereafter to speed up the reduction process using other component functions 
if /is a component of the vector function f. 

Hereafter, the notation g{Xj) for a term of the function / (x) implicitly 
represents any term of any component fiinction. This eliminates the need for 
10 additional subscripts that do not add clarity to the exposition. 

In one embodiment of the present invention, applying term consistency 
involves symbolically manipulating an equation within the computer system to 
solve for a first term, g{xj\ thereby producing a modified equation g{xj) = /?(x), 
wherein the first term g{xj) can be analytically inverted to produce an inverse 
15 ftinction g"^(y). Next, the system substitutes the subbox X into the modified 

equation to produce the equation g{X)) = h{X) and then solves for = g~\h{Xy). 
The system then intersects X) with the interval Jl^ to produce a new subbox X"^, 
which contains all solutions of the equation within the subbox X, and wherein the 
size of the new subbox X"^ is less than or equal to the size of the subbox X, 
20 In one embodiment of the present invention, the system additionally 

performs the Newton method on the John conditions. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG, 1 illustrates a computer system in accordance with an embodiment of 
25 the present invention. 

FIG, 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. 
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FIG. 3 illustrates an arithmetic unit for interval computations in 
accordance with an embodiment of the present invention. 

FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. 
5 FIG. 5 illustrates four different interval operations in accordance with an 

embodiment of the present invention. 

FIG. 6 is a flow chart illustrating the process of finding an interval solution 
to a nonlinear equation using term consistency in accordance with an embodiment 
of the present invention. 
10 FIG. 7 A presents a portion of a flow chart illustrating the process of using 

term consistency to solve an interval global optimization problem with inequality 
constraints in accordance v^th an embodiment of the present invention. 

FIG. 7B presents a portion of a flow chart illustrating the process of using 
term consistency to solve an interval global optimization problem with inequality 
15 constraints in accordance with an embodiment of the present invention. 

FIG. 7C presents a portion of a flow chart illustrating the process of using 
term consistency to solve an interval global optimization problem with inequality 
constraints in accordance with an embodiment of the present invention. 

FIG. 7D presents a portion of a flow chart illustrating the process of using 
20 term consistency to solve an interval global optimization problem with inequality 
constraints in accordance with an embodiment of the present invention. 



DETAILED DESCRIPTION 

The following description is presented to enable any person skilled in the 
25 art to make and use the invention, and is provided in the context of a particular 
application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
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principles defined herein may be applied to other embodiments and applications 
without departing firom the spirit and scope of the present invention. Thus, the 
present invention is not limited to the embodiments shown, but is to be accorded 
the widest scope consistent with the principles and features disclosed herein. 

5 The data structures and code described in this detailed description are 

typically stored on a computer readable storage medium, which may be any device 
or medium that can store code and/or data for use by a computer system. This 
includes, but is not limited to, magnetic and optical storage devices such as disk 
drives, magnetic tape, CDs (compact discs) and DVDs (digital versatile discs or 

1 0 digital video discs), and computer instruction signals embodied in a transmission 
medium (with or without a carrier wave upon which the signals are modulated). 
For example, the transmission medium may include a communications network, 
such as the Internet. 

15 Computer System 

FIG. 1 illustrates a computer system 100 in accordance with an 
embodiment of the present invention. As illustrated in FIG. 1, computer system 
100 includes processor 102, which is coupled to a memory 1 12 and a to peripheral 
bus 110 through bridge 106. Bridge 106 can generally include any type of 
20 circuitry for coupling components of computer system 100 together. 

Processor 102 can include any type of processor, including, but not limited 
to, a microprocessor, a mainframe computer, a digital signal processor, a personal 
organizer, a device controller and a computational engine within an appliance. 
Processor 102 includes an arithmetic unit 104, which is capable of performing 
25 computational operations using floating-point numbers. 

Processor 102 communicates with storage device 108 through bridge 106 
and peripheral bus 110. Storage device 108 can include any type of non-volatile 

10 

Attorney Docket No. SUN-P6446-SPL Inventors: Walster et al. 

ARPC \MY DOCUMENTS\SUN MICROSYSTEMS\SUN-P6446-SPL\SUN-P6446-SPL APPLICATIONS DOC 



t 



storage device that can be coupled to a computer system. This includes, but is not 
limited to, magnetic, optical, and magneto-optical storage devices, as well as 
storage devices based on flash memiory and/or battery-backed up memory. 
Processor 102 communicates with memory 112 through bridge 106. 
5 Memory 112 can include any type of memory that can store code and data for 
execution by processor 102. As illustrated in FIG. 1, memory 1 12 contains 
computational code for intervals 1 14. Computational code 1 14 contains 
instructions for the interval operations to be performed on individual operands, or 
interval values 115, which are also stored within memory 112. This 
10 computational code 1 14 and these interval values 1 15 are described in more detail 
below with reference to FIGs. 2-5. 

Note that although the present invention is described in the context of computer 
system 100 illustrated in FIG. 1, the present invention can generally operate on 

any type of computing device that can perform computations involving floating- 
1 5 point numbers. Hence, the present invention is not limited to the computer system 
100 illustrated in FIG. 1. 

Compiling and Using Interval Code 

FIG. 2 illustrates the process of compiling and using code for interval 
20 computations in accordance with an embodiment of the present invention. The 
system starts with source code 202, which specifies a number of computational 
operations involving intervals. Source code 202 passes through compiler 204, 
which converts source code 202 into executable code form 206 for interval 
computations. Processor 102 retrieves executable code 206 and uses it to control 
25 the operation of arithmetic unit 1 04 . 



11 

Attorney Docket No. SUN-P6446-SPL Inventors: Walster et al. 

ARPC-\MY DOCUMENTS\SUN MICROS YSTEMS\SUN-P6446-SPL\SUN-P6446-SPL APPLICATIONS DOC 



Processor 102 also retrieves interval values 115 from memory 1 12 and 
passes tliese interval values 115 through arithmetic unit 104 to produce results 
212. Results 212 can also include interval values. 

Note that the term "compilation" as used in this specification is to be 
5 construed broadly to include pre-compilation and just-in-time compilation, as well 
as use of an interpreter that interprets instructions at rim-time. Hence, the term 
"compiler" as used in the specification and the claims refers to pre-compilers, 
just-in-time compilers and interpreters. 



10 Arithmetic Unit for Intervals 

FIG. 3 illustrates arithmetic unit 104 for interval computations in more 
detail accordance with an embodiment of the present invention. Details regarding 
the construction of such an arithmetic unit are well known in the art. For 
example, see U.S. Patent Nos. 5,687,106 and 6,044,454. Arithmetic unit 104 

15 receives intervals 302 and 312 as inputs and produces interval 322 as an output. 
In the embodiment illustrated in FIG. 3, interval 302 includes a first 
floating-point number 304 representing a first endpoint of interval 302, and a 
second floating-point number 306 representing a second endpoint of interval 302. 
Similarly, interval 3 12 includes a first floating-point number 314 representing a 

20 first endpoint of interval 312, and a second floating-point number 3 1 6 

representing a second endpoint of interval 312. Also, the resulting interval 322 
includes a first floating-point number 324 representing a first endpoint of interval 
322, and a second floating-point number 326 representing a second endpoint of 
interval 322. 

25 Note that arithmetic unit 104 includes circuitry for performing the interval 

operations that are outlined in FIG. 5. This circuitry enables the interval 
operations to be performed efficiently. 
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However, note that the present invention can also be appUed to computing 
devices that do not include special-purpose hardware for performing interval 
operations. In such computing devices, compiler 204 converts interval operations 
into a executable code that can be executed using standard computational 

5 hardware that is not specially designed for interval operations. 

FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. The 
system starts by receiving a representation of an interval, such as first floating- 
point number 304 and second floating-point number 306 (step 402). Next, the 

1 0 system performs an arithmetic operation using the representation of the interval to 
produce a result (step 404). The possibilities for this arithmetic operation are 
described in more detail below with reference to FIG. 5. 

Interval Operations 

15 FIG. 5 illustrates four different interval operations in accordance with an 

embodiment of the present invention. These interval operations operate on the 
intervals Zand Y. The interval Jf includes two endpoints, 

X denotes the lower bound of X, and 
X denotes the upper bound of X. 

The interval Z is a closed subset of the extended (including -^o and +o°) real 
20 numbers 7?* (see line 1 of FIG. 5). Similarly the interval 7 also has two endpoints 
and is a closed subset of the extended real numbers 7?* (see line 2 of FIG. 5). 

Note that an interval is a point or degenerate interval if X= [x, x]. Also 
note that the left endpoint of an interior interval is always less than or equal to the 
right endpoint. The set of extended real numbers, 7?* is the set of real numbers, R, 
25 extended with the two ideal points negative infinity and positive infinity: 
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R^ = Ru {-00} u {+00}. 

In the equations that appear in FIG. 5, the up arrows and down aiTOws 
5 indicate the direction of rounding in the next and subsequent operations. Directed 
rounding (up or down) is applied if the result of a floating-point operation is not 
machine-representable. 

The addition operation Z+ 7 adds the left endpoint of Xto the left 
endpoint of 7 and rounds down to the nearest floating-point number to produce a 
10 resulting left endpoint, and adds the right endpoint of Xto the right endpoint of Y 
and rounds up to the nearest floating-point number to produce a resulting right 
endpoint. 

Similarly, the subtraction operation X- 7 subtracts the right endpoint of Y 
from the left endpoint of X and rounds down to produce a resulting left endpoint, 

1 5 and subtracts the left endpoint of Y from the right endpoint of X and rounds up to 
produce a resulting right endpoint. 

The multiplication operation selects the minimum value of four different 
terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X multiplied by the left endpoint of Y; the left endpoint of X 

20 multiplied by the right endpoint of Y; the right endpoint of X multiplied by the left 
endpoint of Y; and the right endpoint of X multiplied by the right endpoint of 7. 
This multiplication operation additionally selects the maximum of the same four 
terms (rounded up) to produce the resulting right endpoint. 

Similarly, the division operation selects the minimum of four different 

25 terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X divided by the left endpoint of 7; the left endpoint of X divided 
by the right endpoint of 7; the right endpoint of X divided by the left endpoint of 
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Y; and the right endpoint of X divided by the right endpoint of Y, This division 
operation additionally selects the maximum of the same four terms (rounded up) 
to produce the resulting right endpoint. For the special case where the interval Y 

includes zero, X/Y is an exterior interval that is nevertheless contained in the 
5 interval i?* 

Note that the resuh of any of these interval operations is the empty interval 
if either of the intervals, Xor Y^ are the empty interval. Also note, that in one 
embodiment of the present invention, extended interval operations never cause 
undefined outcomes, which are referred to as "exceptions" in the IEEE 754 
10 standard. 

Term Consistency 

FIG. 6 is a flow chart illustrating the process of solving a nonlinear 
equation through interval arithmetic and term consistency in accordance with an 

15 embodiment of the present invention. The system starts by receiving a 
representation of a nonlinear equation^^) = 0 (step 602), as well as a 
representation of an initial interval Z (step 604). Next, the system symbolically 
manipulates the equation/x) = 0 into the form g(x >/?(x) = 0 to solve for a term 
g(x ') = h{x\ wherein the term g(x ') can be analytically inverted to produce and 

20 inverse function ^(y)(step 606). 

Next, the system substitutes the initial interval X into h{x) to produce the 
equation g{X') = h(X) (step 608). The system then solves for X' = g'\h(X)) 
(step 610). The resulting interval X' is then intersected with the initial interval X 
to produce a new interval (step 612). 

25 At this point, the system can terminate. Otherwise, the system can 

perform further processing. This further processing involves setting temporarily 
savingXby setting J^^^^ =Xmd by settingX-J^^ (step 614). Next, if w(y^^) has 
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been sufficiently reduced (step 616), the system returns to step 606 for another 
iteration of term consistency on another term g of fix). Otherwise, the process 
terminates. 



Examples of Applying Term Consistency 

For example, suppose/x) - -x + 6. We can define g{x) = and 
h(x)=X'6, Let X= [-10,10]. The procedural step is (Xf =X- d= [46,4]. Since 

(X'f must be non-negative, we replace this interval by [0,4]. Solving foryY', we 
10 obtain X' = ± [0,2]. In replacing the range of h{x) (i.e., [-16,4]) by non-negative 
values, we have excluded that part of the range h(x) that is not in the domain of 
gix) -x^. 

Suppose that we reverse the roles of g and h and use the iterative step 
h(X') = giX). That isX'-6= X\ We obtainX^ = [6,106]. Intersecting this 
1 5 result with the interval [- 1 0, 1 0], of interest, we obtain [6, 1 0] . This interval 

excludes the set of values for which the range of g{X) is not in the intersection of 
the domain of h{X) withX 

Combining these results, we conclude that any solution of 
J{X) - g{X) - h{X) = 0 that occurs inZ== [-10,10] must be in both [-2,2] and 
20 [6,10]. Since these intervals are disjoint, there can be no solution in [-10,10]. 

In practice, if we already reduced the interval from [-10,10] to [-2,2] by 
solving for g, we use the narrower interval as input when solving for h. 

This example illustrates the fact that it can be advantageous to solve a 
given equation for more than one of its terms. The order in which terms are 
25 chosen affects the efficiency. Unfortunately, it is not known how best to choose 
the best order. 
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Also note that there can be many choices for g{x). For example, suppose 
we use term consistency to narrow the interval bounds on a solution of 
fix) == ax^ + 6x + c = 0. We can let g{x) = bx and compute X' = -{aJif + c)/h or 
we can let g{x) = ax"^ and compute X' = ± [-{bX+cyaY^^. We can also separate 
5 into x^ * x' and solve for one of the factors X' ^ ± [-{bX+c)/{aX^)] 

In the multidimensional case, we may solve for a term involving more than 
one variable. We then have a two-stage process. For example, suppose we solve 
for the term l/(x+y) from the functionXx,;;) = l/ix-^-y) - h{x,y) = 0. Let 
xeX= [1,2] and;; e 7= [0.5,2]. Suppose we find that 7)^ [0-5 J]. Then 
10 l/(x-\-y) E [0.5,1] so x+y e [1,2]. Now we replace;/ by 7= [0.5,2] and obtain the 
bound [-1,1.5] onX Intersecting tiiis interval with the given bound X= [1,2] on 
X, we obtain the new bound X' = [ IJ . 5] . 

We can use X' to get a new bound on h; but his may require extensive 
computing if is a complicated function; so suppose we do not. Suppose that we 
15 do, however, use this bound on our intermediate result x + >• = [ 1 ,2] . Solving for y 
as [1,2] -~X\ we obtain the bound [-0.5,1], Intersecting this interval with 7, we 
obtain the new bound 7' = [0.5,1] on;;. Thus, we improve the bounds on both x 
and;; by solving for a single term of/ 

The point of these examples is to show that term consistency can be used 
20 in many ways both alone and in corabination with the interval Newton algorithm 
to improve the efficiency with which roots of a single nonlinear equation can be 
computed. The same is true for systems of nonlinear equations. 

Term Consistency and Inequality Constrained Interval Global Optimization 

25 FIGs, 7A-7D collectively present a flow chart illustrating the process of 

using term consistency in solving an interval global optimization problem with 
inequality constraints in accordance with an embodiment of the present invention. 
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Generally, we seek a solution in a single box specified by the user. However, any 
number of boxes can be initially specified. 

The boxes can be disjoint or overlap. However, if they overlap, a 
minimum at a point that is common to more than one box is separately found as a 
5 solution in each box containing it, ][n this case, computing effort is wasted. If the 
user does not specify an initial box or boxes, we use a default box. The process 
finds the global minimum in the set of points formed by the set of boxes. We 
assume these initial boxes are placed in a list Lj of boxes to be processed. 

Suppose the user of the process knows a point x_bar that is guaranteed to 
10 be feasible. If so, we use this point to compute an initial upper hound f_bar on the 
global minimum/*. If xjbar cannot be represented exactly on the computer, the 
system forms a representable interval vector X containing x bar. We evaluate 
f{X) and obtain [lower bound XX) ,upper bound /(X)]. Even if rounding and/or 
I dependence are such that X cannot be numerically proven to be certainly feasible, 

2 15 we rely upon the user and assume that X contains a feasible point. Therefore, we 

Kiss: 

set fjbar equal to the upper bound of/X). 
fy Also the user may know an ijipper hound fjbar on/* even though he may 

f ^ not know where (or even if) / takes on such a value in the feasible region defined 

by the inequality constraints. If so, we set fj?ar equal to this known bound. If the 

I 8 

20 known bound is not representable on the computer, the system rounds the value 

up to a larger value that is representable. 

If no feasible point is known and no upper bound on/* is known, we set 

fJbar = +«>. We assume the user has specified a box size tolerance ex and a 

function width tolerance Sf. 
25 Since the process may use a procedure for finding an unconstrained 

minimum, the system provides the parameters needed therein. Therefore, the 

system specifies wr and wj . In doing so, it sets If a single box X^^^ is 
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given, the system sets w/ = w{X^\ wherein >v(X^ 0 is the maximum of the widths 
of all elements in the vector X^^\ If more than one box is given, the system sets w/ 
equal to the width of the largest one. Also, the system sets = 0. 

Thus, to initialize our process, the system specifies ex, ^f, wr, wj, and the 
5 initial box(es). Li addition, the system specifies a hound fjbar if one is known. 

The steps of the process are to be performed in the order given except as 
indicated by branching. 

First, for each box in the list I/, the system applies term consistency to 
each of the inequality constraints pi(x) < 0 {i^l ...,m) (step 701). 
1 0 lffjbar<+oo, then for each box in , the system applies term consistency 

to the inequality f<fj)ar (step 702). 

If is empty, the system goes to step 742. Otherwise, the system selects 
(for the next box X to be processed) the box in 1/ for which the lower bound of 
XX) is smallest. For later reference;, the system denotes this box by X^^\ The 
1 5 system also deletes X from Li (step 703). 

The system applies term consistency over X to each constraint inequality. 
If X is deleted, the system goes to step 703. The system skips this step if X has 
not changed since step 701. (step 704). 

Next, the system computes an approximation x for the center m(X) of X. 
20 If the upper bound ofy(x)>/_6ar, the system goes to step 708 (step 705). 

For future reference, the system denotes the box X by X^^\ Next, the 
system does a line search to try to reduce (step 706). 

If fjbar was not reduced in step 706, the system goes to step 709 
(step 707). 

25 Next, the system applies teim consistency to the inequality /x) ^ fjbar. If 

X is deleted, the system goes to step 703 (step 708). 
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< 



lfw(X)<ex and w[f{X)]< Sp, the system puts X in list i^. Otherwise, if X 
is sufficiently reduced relative to the box X^^^ defined in step 703, the system puts 
X in 1/ and goes to step 703 (step 709). We say that a box X is sufficiently 
reduced if any component of X is reduced by an amount that is at least a fraction 
5 (say 0,25) of the width of the widest component of X. 

Next, the system applies box consistency to each inequality constraint. If 
fj)ar <+oo, the system also applies box consistency to the inequality y(x):^Z)ar. 
If X is deleted, the system goes to step 703 (step 710). 

If the upper bound of pi(S)>0 for any i^I,.,.,n, (i.e., if X is not certainly 
1 0 strictly feasible), the system goes to step 726 (step 711). 

Next, the system applies term consistency to g,(x)=0 for /=7,...,«, where g 
is the gradient of the objective function / If the resuh for any is empty, 

the system goes to step 703 (step 712). Note that the steps 712 through 725 do not 
use inequality constraints because none are active for the current box X. 
1 5 Otherwise, the system applies term consistency to the relation //„(x)^ 0 for 

?=l,...,w, where //„ is a diagonal element of the Hessian of f. If the result is empty, 
the system goes to step 703 (step 713). 

Next, the system repeats step 709 (step 714), 

The system then applies box consistency to gt^O for /=!,.. If the result 
20 is empty, the system goes to step 703 (step 715). 

Next, the system applies box consistency to //„(x)>0 for z=7,.,.,«. If the 
result is empty, the system goes to step 703 (step 716). 
Next, the system repeats step 709 (step 717). 

The system then uses a criterion w(X) > (wj+wr)/! to decide if a Newton 
25 step should be applied to solve g=0. If not, the system goes to step 726 
(step 718). Note that, wi denotes the width of the smallest box for which 
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= BJ(x,X) is irregular. If is regular for a given box, wr denotes the width 
of the largest box for which is regular. 

The system generates the interval Jacobian J(x,X) of the gradient g and 
computes the approximate inverse B of the center of J(x,X). The system also 
5 applies one step of an interval New1,on method to solve g=0. If the result is 
empty, the system goes to step 703 (step 719). 

Next, the system repeats step 709 (step 720). 
The system then uses the matrix B found in step 719 to obtain Bg in 
analytic form. The system applies term consistency to solve the /-th equation of 
10 Bg=0 for the z-th variable Xi for i=l,.,.,n. If the result is empty, the system goes to 
step 703 (step 721). 

Next, the system repeats step 709 (step 722), 
The system uses box consistency to solve the Hh equation of Bg (as 
,^:J obtained in step 721) for the z-th variable for i=l,...,n. If the result is empty, the 

''4 15 system goes to step 703 (step 723). 
^; Next, the system repeats step 709 (step 724). 

The system uses the matrix B found in step 719 in a line search method to 
H try to reduce the upper hound f bar (step 725). A line search can be performed as 

p follows. Suppose we evaluate the gradient g(x) off[\) at a point x. Note that/ 

^ 20 decreases (locally) in the negative gradient direction from x. A simple procedure 
for finding a point where /is small is to search along this half-line. Let x be the 
center of the current box. Define the half-line of points y{a)=X'ag{x) where a>0. 
We now use a standard procedure for finding an approximate minimum of the 
objective function / on this half-line. We first restrict our region of search by 
25 determining the value a' such that y(a')=X'a'g is on the boundary of the current 
box X, and search between x and x '^^X^O- We use the following procedure. 
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Each application of the procedure requires an evaluation of/ Procedure: If 
J(x ')<fix), replace x by (x+x y2 Otherwise, we replace x' by (x+x')/2. 

Next, the system computes an approximation x for the center m(X) of X. 
If J{x)>f_bar, the system goes to step 703 (step 726). 
5 The system skips this step and goes to step 732 if X = X^^\ the same box 

for which a line search was done in step 706. Otherwise, the system does a line 
search to try to reduce /_6ar. If f_bar is not reduced, the system goes to step 732 
(step 727). 

For future reference, the system denotes X by X^^\ The system then uses a 
10 linearization test to decide whether to linearize and "solve" the inequality 
J{x)<fJ)ar. If this condition is not satisfied, the system goes to step 732 
(step 728). 

The system uses a linear method to try to reduce X using the inequality 
J{\)<f_bar. If X is deleted, the system goes to step 703. Otherwise, if this 
1 5 application of the linear method does not sufficiently reduce the box X^^\ the 
system goes to step 731 (step 729). 

The system uses a quadratic method to try to reduce X using the inequality 
f(x)<f_bar. If X is deleted, the system goes to step 703 (step 730). 

Next, the system repeats step 709 (step 731), 
20 The system uses a criterion similar to that in step 71 8 to decide whether to 

linearize and "solve" the inequality constraints. If the procedure indicates that the 
linearization should not be done, the system goes to 739 (step 732). 

Next, the system selects the Inequality constraints to be solved in 
hnearized form, and possibly adds to this set the imqudlityj{\)<f_bar. Note that 
25 the selection process removes from consideration any inequality constraints that 
are not sufficiently violated. If no inequalities are selected, the system goes to 
step 739. Otherwise, the system linearizes the resulting set of inequalities, and 
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solves the resulting set of linear inequalities. If the solution set is empty, the 
system goes to step 703 (step 733). 

Next, the system repeats step 709 (step 734). 

The system then uses the preconditioning matrix B formed at step 733 to 
5 analytically precondition the set of inequalities that were selected for use in step 
733. The system also uses term consistency to solve each of the preconditioned 
inequalities. In so doing, each inequality is solved for the same (single) variable 
for which the linearized inequality v/as solved in step 733 (step 735). 
Next, the system repeats step 709 (step 736). 
10 The system uses box consistency to solve the same inequalities for the 

same variables as in step 735 (step 737). 

Next, the system repeats step 709 (step 738). 

The system uses the linearization test to decide whether to solve the John 
conditions. If not, the system goes to step 742 (step 739). 
15 The system modifies the John conditions by omitting those constraints pi 

for which pi (X)<0 (since they are not active in X). The system applies one pass 
of the interval Newton method to the (modified) John conditions. If the result is 
empty, the system goes to step 703 (step 740). 

Next, the system repeats step 709 (step 741). 
20 In various previous steps, gaps may have been generated in components of 

X. If so, the system merges any of these gaps that overlap. The system then splits 
X, and places the resulting subboxes in Lj and goes to step 703 (step 742). 

If f_bar<+oo^ the system applies term consistency to fix) <f_bar for each 
box in the list I^. The system denotes those that remain by X^^^.-, X^^^ where s is 
25 the number of boxes remaining. The system also determines 
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^ = min/(^'^) ^rid F = inax/(^'^) (step 743). 



Finally, the system terminates (step 744). 

After termination, w(K)<€x and w{f{X))< Ep for each remaining box X in 
5 the list I2. Also, 

F<m<F 

for every point x in all remaining boxes. If, after termination, /^Z?ar < +<», we 
1 0 know there is a feasible point in the initial box(es). Therefore, we know that 



F</*<min{/,4 



If, after termination, ^ +00^ then we have not found a certainly 
15 feasible point There may or may not be one in X^^\ However, we know that if a 
feasible point x does exist in X^^\ then 

F<m<F. 

20 Suppose a feasible point exists. If our algorithm fails to find a certainly 

feasible point, then it does not produce an upper bound /_Z?ar and cannot use the 
relation / <fj)ar. In particular, it cannot delete local minima where 7tx)>/*. In 
this case, all local minima are contained in the set of output boxes. 

If all of the initial box X^^^ is deleted by our process, then we have proved 

25 that every point in X^^^ is infeasible. Suppose that every point in X^^^ is infeasible. 
Our process may prove this to be the case. However, we delete a subbox of X^^^ 
only if it is certainly infeasible. Rounding errors and/or dependence may prevent 
us from proving certain infeasibility of an infeasible subbox. Increased 
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wordlength can reduce rounding errors and decreasing Sx can reduce the effect of 
dependence by causing subboxes to eventually become smaller. However, neither 
effect can completely be removed. 

Suppose = +00 after termination and X^^^ has not been entirely 
eliminated. It may still be possible either to compute /^6ar < or to delete all of 
X^^^ by reducing the values of Sx and 8f and continuing to apply the process. To 
try to do so^ we need only to reduce these tolerances and move the boxes from list 
1,2 to list L}, We can then restart the algorithm from the beginning with or without 
use of increased precision. 

Note that steps 712 through 725 are essentially the same as corresponding 
steps in the process for unconstrained optimization. This is because these steps are 
applied to a box that is certainly feasible. 

In our process, we avoid using more complicated procedures until the 
simpler ones no longer make sufficient progress in reducing the current box. For 
example, we delay use of the John conditions until all other procedures become 
unproductive. 

We avoid using procedures that use Taylor expansions until we have 
evidence that expanded forms provide sufficiently accurate approximations to 

functions. 

Inequality constraints are often simple relations of the form Xj<bi or Xj>ai 
Such constraints serve to determine the initial box Therefore, they are 
satisfied throughout X^^l Such constraints are omitted when applying any 
procedure designed to eliminate infeasible points. See steps 701, 704, 710 and 
733, 

In step 706 we use a line search to try to reduce /^/?^r. This involves 
evaluating the gradient of / We can avoid this evaluation by simply checking if 
the midpoint x of the box is feasible and, if so, using^x) as a candidate value for 
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X 



f_bar. However, it helps to have a finite value of fjbar early, so the line search is 
worth doing when7l6ar=+«>. Step 727 also uses a line search. It is less important 
here because a finite value of fbar is likely to be computed in step 706. If there 
are a large number of constraints, then evaluating the gradient is not a dominant 
5 part of the work to do the line search. 

Experience has shown that efficiency is enhanced if the subbox X to be 
processed is chosen to be the one for which infiJ^S)) is smallest among all 
candidate subboxes. This tends to cause a smaller value of fJbar to be computed 
early in the algorithm. Therefore, we return to step 703 to choose a new subbox 
10 whenever the current box has substantially changed. 

Suppose we find that pi(X)<0 for some value of and some box X. Then 
plX ')<0 for any X tX. Therefore, we record the fact that Pi{X)<0 so that we 
need not evaluate ^^(X ') for any X c X. 

It is possible that the procedures in step 719, 721, 723 or 740 prove the 
15 existence of a solution to the optimization problem. If so, the user can be 
informed of this fact. Such a solution may be local or global. 

The foregoing descriptions of embodiments of the present invention have 
been presented only for purposes of illustration and description. They are not 
intended to be exhaustive or to limit the present invention to the forms disclosed. 
20 Accordingly, many modifications and variations will be apparent to practitioners 
skilled in the art. Additionally, the above disclosure is not intended to limit the 
present invention. The scope of the present invention is defined by the appended 
claims. 
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